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ABSTRACT: Zero-crossing or feature-point based stereo algorithms can, by definition, determine 
explicit depth information only at particular points in the image. To compute a complete surface 
description, this sparse depth map must be interpolated. A computational theory of this interpolation 
or reconstruction process, based on a surface consistency constraint, has previously been proposed. 
In order to provide stronger boundary conditions for the interpolation process, other visual cues 
to surface shape are examined in this paper. In particular, it is shown that, in principle, shading 
information from the two views can be used to determine the orientation of the surface normal 
along the feature-point contours, as well as the parameters of the reflective properties of the 
surface material. The numerical stability of the resulting equations is also examined. 
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1. Introduction 


1.1. The Global Problem 

One of the goals of a vision system is to compute the three-dimensional shape of the 
visible surfaces in a scene. The human visual system uses many cues to compute surface shape, 
with different modules of the system using varying sources of information in the images to 
infer information about surface shape. Examples include motion analysis, stereopsis, shading, and 
texture. How do these different visual modalities contribute to the computation of surface shape? 

One computational approach to understanding the human visual system, pioneered by Marr 
and Poggio (see, for example, [Marr, 1976, 1982; Marr and Poggio, 1977]), views tine computation 
of surface shape from images, in part, as a collection of transformations between two main 
representations. The first representation is the primal sketch , which makes explicit loci of changes 
in uriage irradiance at particular scales of resolution; the second is the 2^-D sketch , which makes 
explicit information about surface shape and reflective properties of the surface material. The 
modules that compute information feeding the 2^-D sketch from the primal sketch have generally 
been considered to a first approximation to be independent of one another. It is clear, however, 
that within the 2J-D sketch the different sources of information should interact, both to maintain 
consistency among tire data provided by different modules, and to provide feedback to the modules 
in order to enhance the acquired data (for example, texture contours can facilitate stereopsis 
by driving vergence eye movements [Kidd, et al., 1979]). In this paper, we are interested in 
examining interactions at the level of the 2^-D sketch between modules of the early visual system. 
In particular, we will investigate some of the ways in which shading information can augment 
stereo data. 

1.2. The Motivating Problem 

The goal of the 2^-D sketch [see, for example, Marr, 1978, 1982] is to compute surface 
parameters, in particular, the distance to and orientation of small patches of the visible surfaces, 
the discontinuities in those surfaces (for example, the edges of objects), and possibly the properties 
of the surface material (for example, the amount of specularity, the colour and the albedo of 
the surface material). Representations similar to the 2^-D sketch have also been suggested by 
Horn [1979] and Barrow and Tenenbaum [1979], As mentioned, the input to this representation is 
provided by a number of roughly independent modules, two important ones of which are stereopsis 
and motion perception. The input provided by these particular modules, which is characteristic 
of many early visual modules, consists of explicit information about surface shape or disposition 
only at particular points in the image. 

This follows from the form of the primal sketch. Both psychophysical investigations [for 
example, Attneave, 1954; Barlow, 1961] and computational studies [for example, Roberts, 1965; 
Hcrskovitz and Binford, 1970; Horn, 1973; Marr, 1976; Marr and Hildreth, 1980, and reviews 
by Roscnfeld and Kak, 1976; and Pratt, 1978] of early visual processing suggest that most of 
the information in an image is carried by the intensity changes, and hence, that the first stage 
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of analysis of images is the detection of such changes. These changes in intensity are used as 
input by the modules that feed the 2^-D sketch and hence explicit information is obtained only 
at those locations. For example, in stereo computations [Marr and Poggio, 1979; Grimson, 1981; 
Baker and Binford, 1981; Mayhcw and Frisby, 1981] the zero-crossing contours of the primal 
sketches [Marr and Hildreth, 1980] for the left and right eye are placed in correspondence, and 
die difference in projection of the corresponding contours in the two eyes is used to determine the 
depth of the surface along that contour. To create a complete surface representation it is necessary 
to interpolate between die known values of the raw 2|-D sketch. An initial theory of this process 
that implicitly takes into account some of the shading information available in the primal sketch 
has been proposed [Grimson, 1982a, 1982b; see also refinements by Terzopoulos, 1982], 

While this interpolation or reconstruction theory constructs the “best” surface to fit through 
the known dcpdi points provided by stereopsis, it is clear that, in principle, supplying additional 
constraints to the reconstruction process would result in an improved accuracy in the computed 
surface shape. 

An extreme example of diis can be seen in Figure 1, which shows one possible set of depth 
constraints obtained by stereoscopicaily viewing a sinusoidal depdi grating of a uniform material 
from a particular viewpoint. The most conservative reconstruction, without explicitly taking the 
shading information into account, would be the plane shown in Figure lb. On the other hand, 
if surface orientation information were also available along the depth contours, the more correct 
reconstruction of Figure lc would be obtained. Thus, the motivation is to find algorithms for 
computing surface orientation at the known depth points provided by stereopsis. 

1.3. The Specific Problem 

Given that we want to augment the boundary conditions supplied to the reconstruction 
process, the specific question to be investigated here is whether shading information along the 
zero-crossing contours of the primal sketch can be used to provide information about the surface 
orientation along those contours. 

The traditional approach to the shape-from-shading problem, pioneered by Horn [1970, 1975], 
has been monocular. The goal has been to extract explicit surface orientation values at each point in 
the image, from a single view of the scene. As stated, the problem is considerably underconstrained 
and thus additional information is required in order to obtain the surface orientation information 
(see for example [Ikeuchi and Horn, 1981; Bruss, 1981; Brooks, 1982]). For example, it is usually 
assumed that the direction and strength of the illuminant and the reflective properties of the 
surface material (which are assumed constant over large sections of the image) are known. One 
method for obtaining the additional information necessary to solve for the surface orientation, 
under these assumptions, is to use the technique of photometric stereo [Woodham, 1978, 1980, 
1981; Ikeuchi and Horn, 1979; Silver, 1980], In this case, multiple images obtained from a single 
viewpoint, but with different illuminant directions, are used to determine the surface orientation 
at points in the image. 

The intention in this paper is to solve (at least partially) the shading problem with fewer 


Figure 1 . Part (a) shows an extreme example of a set of depth constraints that could be obtained by 
stereoscopically viewing a sinusoidal depth grating. If shading information is not explicitly accounted for, 
the best surface reconstruction is the plane shown in part (b). On the other hand, if the shading information 
is used to obtain surface orientation information along the depth contours, the more accurate reconstruction 
of part (c) is obtained. 





requirements or assumptions about the parameters of the imaging process. To do so will require 
either additional sources of information about shading or weaker expectations on the delivered 
data, or both. Indeed, since the motivating problem is to obtain additional boundary conditions on 
die reconstruction process, we will only expect surface orientation information at the zero-crossings, 
rather than everywhere in the image. We will further only expect a coarse estimate of the surface 
normal, since the values will serve as boundary conditions for a surface approximation process, 
rather than constituting an explicit surface reconstruction in itself. Moreover, we will introduce 
an additional source of information to the process. In particular, we will investigate the added 
information provided by the differing shading information observed at nearby viewpoints. Hence, 
we are concerned with binocular shading , especially as it applies to the surface reconstruction 
problem. The main questions to be addressed here are: (1) Using binocular shading, under what 
conditions can we solve for the surface orientation at the zero-crossing contours that have been 
matched by stereopsis, and (2) How numerically stable is the resulting process? We will see that 
the method used to solve for the surface orientation is essentially an alternate form of photometric 
stereo, using images obtained from different viewpoints, rather than with different illuminants. 

2. The Basic Equations 


The brightness recorded at a point in an image can generally be considered as a product 
of three factors, the total amount of incident light striking the surface, the percentage of such 
incident light which is reflected (as opposed to absorbed or transmitted), usually denoted by the 
albedo, p, of the surface, and the reflectance of the surface, the distribution of the reflected light as 
a function of direction. Here, we shall assume that the intensity of the incident light is normalized, 
by incorporating it into tire definition of p. As a consequence, the parameter p is not restricted to 
the range 0 — 1, but rather can take on any positive value. 


Using a right-hand coordinate system with origin at the left eye, x axis connecting the two 
eyes and the negative z axis pointing straight ahead, a surface may be represented by a Monge 
patch 

K y,—f{x,y)} 

and the surface normal at a point, by 

{p, 9,1} 

where 


P = 


df 

dx 


q = —. 
dy 


The situation to be investigated here is one in which the illumination geometry consists of an 
arbitrary surface illuminated by a point source, sufficiently distant from the surface relative to the 
distance to the viewer. Under most circumstances (for example, tire surface reflectance is isotropic), 
the reflectance properties of the surface can be combined with the illumination geometry and the 
viewer position into the convenient representation of a reflectance map [Horn, 1977], 

For an arbitrary surface, the surface reflectance map will generally be a combination of two 
effects. The first is a diffuse or matte component, which results from the scattering of light that 


penetrates some distance into the microstructure of the surface. The second is a specular or luster 
component, which results from the mirror-like reflection of light at a smooth interface between 
two materials of different refractive index. These two components can be combined into a single 
reflectance map in the following manner (see, for example, [Horn, 1981; Blinn, 1977; Phong, 
1975]). 


Let n denote a unit vector in the direction of the surface normal, defined at a point on the 
surface, let s denote a unit vector in the direction of the point light source, and let v denote a 
unit view vector. (In general, we will consider objects to be distant relative to the viewer, so that 
v is essentially constant, and will consider objects to also be distant relative to the light source, so 
that s is essentially constant). Then the recorded brightness is given by 

i - pR( n) 

and the reflectance map is given by 


fl(n) = 


(1 — a)(n ■ s) + a{n • h} fc 


( 1 ) 


where h is a unit vector in the direction of the off specular angle, 


h — 


(s+v) 

v^yT-f (sv) 


Here, p, a and k are parameters of the particular surface, with p and fc non-negative and a ranging 
between 0 and 1. 


Note that n is a function of the surface / while s is a function of the illumination geometry. 
For our particular choice of coordinate system, the view vector for the left eye is particularly 
simple, v/„ = {0,0,1}. Now suppose that we take a second view of the surface, with vergence 
angle /? (see Figure 2). Here, the view vector is given by \ R = {— sin/3,0, cos/3}. Note that the 
vergence angle p and the illuminant direction are assumed to be known, so that the vectors 
and s are constants. 

The first term in expression (1) corresponds to a matte or Lambertian component of the 
surface. Since this component depends only on the angle between the light source and the surface 
normal (and hence is independent of v), it is identical for both views. Thus, no additional 
information will be obtained from two views in the case of a perfectly matte surface. The second 
term corresponds to a specular component of the surface, and this term, in general, does change 
between the two views, as can be seen in expression (1). 

Our intent is to use the two expressions for recorded brightnesses in the two eyes to determine 
the orientation of the viewed surface at some point. In order to compare irradiance values from the 
two views, we must be certain that the points in the images at which the irradiance measurements 
are taken are in correspondence (that is, we know which points in the two images correspond to 
the same point on the surface). In the ideal case of isolated intensity edges, matched zero-crossings 
from a stereo algorithm [for example, Grimson, 1981] are in correspondence, so by restricting our 
attention to such points, we are in principle guaranteed irradiance measurements that correspond 
to the same point on a physical surface. We should note, however, that in practice a certain amount 





r\. 




Figure 2. The geometry of two views of the surface, with the vergence angle p indicated. 

of error will be associated with the matched zero-crossings, due to uncertainty in exactly localizing 
the underlying intensity changes. This error will reduce the effectiveness of the computation. 

There are two stages to the computation of surface orientation. The first is to compute the 
surface reflectance parameters p, a and k (in some applications, these might be known a priori ). 
Once the reflectance map is known, we must then solve for the surface orientation along the 
zero-crossing contours, assuming that p , a and k are constant along these contours. More precisely, 
we solve the following problem. Let the recorded brightnesses in the left and right eyes be 

£l — p[(l — a)m + au*] 

£r — £>[(1 — a)m + av k ] 

respectively, where 


m = n • s 
i ,_ (s + vl) • n 

x/2\/l + s • v*, 

v = (S + Vfl) • n 

V2\/l + S • Vfi 

We will first solve for p, a and k over some portion of the image, and then use these values to 
solve for the surface orientation along zero-crossing contours in this region of the image. 


3. Solving for the Reflectance Parameters 

The task of determining the reflectance parameters would be considerably eased if the surface 
orientation (p, g ) were already known. We thus restrict our attention to situations in which this is 
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true. Consider a point of high curvature along a zero-crossing contour, which in die limiting case 
mi ght be a corner where two zero-crossing contours intersect. The orientation of die contour in 
the image plane, along either direction of the contour, is different (within die resolution of the 
image grid). A directional derivative taken along one direction of the contour yields a constraint 
on the relationship between p and q at this point. Since there are two different directions along 
the contour at such a point, we get two different constraints on the surface orientation, each one 
corresponding to a straight line in gradient space, defining a linear relationship between the values 
of p and q at the point. The intersection of dicse two lines uniquely defines the surface orientation 
at die corner. 

We can also perform this computation in a somewhat non-standard representation, by using 
unit normals. The representation we choose to utilize is obtained by orthonormally projecting 
die Gaussian hemi-sphere of all visible unit normals onto the plane spanned by the first two 
components of the unit normal. (Note that this is a one-to-one, onto projection, since we are 
only interested in the half of the Gaussian sphere composed of die unit nonnals corresponding 
to visible surfaces). Thus, rather tiian performing computations in gradient space, we instead use 
die components of die unit nonnals lying in die unit disk. In this case, each directional derivative 
constrains the first two components of die unit normal to lie on a half ellipse in the unit disk. The 
intersection of two such half ellipses uniquely defines the components of the unit surface normal 
at the point. 

3.1. Using Irradiancc Derivatives 

We now consider methods for determining the surface reflectance parameters at such a point. 
Initially we have diree unknowns and only two equations. To uniquely solve for the unknowns, 
we will need additional information. One possibility is to consider using derivatives of the image 
irradiances to solve for the parameters. Taking first directional derivatives of the irradiance 
equations introduces three new variables, p x ,p y {= q x ) and q y . (Note drat q x is assumed to be 
equal to p y , since we shall assume that the surfaces are at least twice continuously differentiable.) 
This gives us a total of 8 unknown variables, the three listed, as well as p, q, p, a and k. At a 
corner, we can compute the values of p and q. Furthermore, second directional derivatives along 
the depth contour yield two additional linear constraints on p x ,p y and q y , so there are only 4 
independent variables. We also have four equations, namely: 

d £l 

~ = p(l- a){m v p x + m qPy } + paku^ 1 {u pPx + u qPy } 

3El 

— = p(l — a){m p p y + m q q y ) -f paku k ~ l {u pPy + u q q y } 

-fa ~ Pi 1 ~ a ){ m pPx + m qPy } + pakv k ~ y {v p p x + v q p y ) 

OEr 

— = p{ 1 — a){m pPy + m q q y } + pakv k ~ l {v pPy + v q q y } 

\ where the subscripts L and R denote the left and right images, and the other subscripts denote 

partial differentiation. 
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3.2. Solving The Equations 


Wc now seek a closed form solution to this set of equations; that is, we seek expressions for 
determining p, a and k as functions of die image irradiance derivatives, the illuminant direction 
and the vergence angle. While it is possible to obtain such a closed form solution (for completeness, 
die solution is given in the appendix), it is not very useful for computational purposes, since it is 
numerically very sensitive, as discussed below. 

3.3. Numerical Stability 


While die appendix describes a solution to the problem of determining the parameters of the 
surface reflectivity, numerical simulations indicate that die solution is not numerically stable. This 
is illustrated by the following example, used to determine whether the image values -§-(£l — Sr) 
ar >d ^(£l — Sr) can be reliably measured from an image. 

Since a visual system is not an infinite resolution device, it is important to take this into 
account when simulating the solution to the above equations. In particular, one cannot use 
analytic expressions for terms such as -§^{£l — Sr) but rather, one must use finite difference 
approximations to such differential expressions, in order to reflect the discrete computational 
structure of the vision system. Moreover, the range of values for S L and S R should also be discrete, 
rather than continuous, since any sensor will have a finite resolution of brightness values. These 
considerations turn out to be critical in evaluating the numerical stability of the computation. 
In particular, the closed form solution for the reflectance parameters given in die appendix in 
principle provides exact values for those parameters, if infinite precision data is available. When 
discrete approximations are used, however, this exact solution is no longer possible, as is illustrated 
by the following example. 

A synthetically shaded sphere of radius 160 pixels with a total irradiance range of 8 bits (or 
256 grey levels) was constructed using equation (1) with the parameters p — 256, a — 0.8 and with 
k varying from 1 to 10. Note that this implies that recorded brightness was descretized to a finite 
precision, thereby restricting the resolution of measurements of spadal change in the difference 
between the image brightnesses. 

The ratio of interocular separation to fixation distance was set at 6:100. The total number 
of points visible to both eyes was counted, and the percentage of such points for which the finite 
differences corresponding to the above partial derivatives exceeded a threshold of 1 grey level, or 
1 part in 256, was measured. This percentage was observed to be small, no more than 4%, even 
for large values of k. This held true over a range of illuminant directions. Some examples are 
shown in Table 1. (Similar results hold for other values of a.) 




/f*N 


k 

vis 

% 


k 

vis 

% 

1 

99.9 

2.27 


i 

89.7 

2.64 

2 

99.9 

0.00 


2 

89.7 

0.08 

3 

99.9 

0.00 


3 

89.7 

0.00 

4 

99.9 

0.00 


4 

89.7 

0.24 

5 

99.9 

1.34 


5 

89.7 

1.77 

6 

99.9 

1.99 


6 

89.7 

2.43 

7 

99.9 

2.33 


7 

89.7 

3.03 

8 

99.9 

2.56 


8 

89.7 

3.42 

9 

99.9 

3.19 


9 

89.7 

3.94 

10 

99.9 

3.67 

_ 

10 

89.7 

4.36 


Table 1: Sensitivity in computing derivatives of image differences 

The table lists the percentage of points on the sphere, visible in both eyes, for which the finite 
difference approximation of the partial derivative of the change in image intensity exceeded a threshold of 
one part in 256. Here, k denotes the exponent of the specular component, and vis is the percentage of all 
possible points on the sphere which are visible to both eyes. The first two components of the unit source 
normal, p s and q s , are given by p s = 0, q s = 0 in the left columns, and p s = 0.557, q s = 0.239 in the 
right, and the vergence angle is = tan -1 j&y. 

This observation suggests that the system of equations listed in the appendix, while providing 
a solution in the ideal case, will not provide a numerically stable solution. One explanation for 
this can be seen by noting dtat any partial derivatives of the irradiancc equations will involve 
partial derivatives of the components of the surface normal. Unless the surface has a significant 
curvature at these points, the partial derivatives of the image irradiances will be small and hence 
the computation of the surface parameters will be numerically unstable. For example, if a smaller 
sphere had been used in the above example, the numerical stability of the solution to the reflectance 
parameters would be improved, since the effective curvature of the sphere would be increased. 

While using finite differences in the change in image brightness between the two views did 
not provide detectable values, it was observed that the simple image differences frequently did. 
Examples are shown in Table 2. Here, we observe that even for strongly specular surfaces (k — 10) 
the percentage of orientations with a detectable (> 1 grey-level) difference in image brightnesses 
is over 55%. Allowing for image sensor noise (for example, image differences > 3) still leaves an 
appreciable percentage of orientations with measurable differences. 




[ 


Ps = 0.557 q s = 0.239 


k 

% total 

% > 1 

% > 2 

% > 3 


i 

96 

85 

53 

25 


2 

92 

91 

63 

37 


3 

89 

89 

62 

39 


4 

89 

83 

57 

38 


5 

88 

77 

53 

36 


6 

88 

71 

49 

34 


7 

88 

66 

45 

32 


8 

88 

62 

42 

31 


9 

88 

58 

40 

29 

_ 

10 

88 

55 

38 

28 
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Table 2: Sensitivity in computing image differences. 

The table lists the percentage of points on the sphere, visible in both eyes with a recorded intensity 
of at least 5 parts in 256, for which the difference in image intensity exceeded a given threshold. Here, k 
denotes the exponent of the specular component, p = tan -1 and p s = 0.557 and q s = 0.239. The first 
column lists the percentage of points on the sphere which are visible in both eyes with sufficient brightness. 
The remaining three columns list the percentage of such points which have an irradiance difference in the 
two eyes of the amount listed. 

This suggests that while we may not be able to reliably measure the spatial derivatives of 
the difference in image brightness, due to the discrete nature of the sensing of brightness values 
and tire discrete approximations to the spatial derivatives, we may be able to rely on the simple 
differences in image brightness. We thus turn to the problem of finding a solution for tire surface 
reflectivity parameters from the image brightnesses alone. 

4. A Second Solution 

We first observe that even recording tire irradiances at a corner (defined here are a point of 
high curvature), where the surface orientation is known, is not sufficient to solve the problem, since 
we have two equations in three unknowns. We thus require an additional source of information 
and will assume that we know the surface orientation at two distinct points, (we will show later 
that these could be two nearby corners or simply one corner and a nearby point). 

At each corner, we note that the following expressions hold (where * denotes the image point 
in question) 

a . _ _ 2m,(£T — gfl). ___ 

— v k ) — (£ L — + v* — 2m t ) 

_ 1 _ 

1 _±_ (£«)<“?—(&■).•»? 

= (£l + — V?) — (£l — £R)j{uj + yf—2 m,) 

2m,K - «? ) 

_ (£l — £j?)t l 

U k _ v k a . 

This gives us a solution for p and a, presuming we know the value of k. To determine k, we 
assume that the surface reflectivity parameters are constant over the region of the image spanned 
by the two known points. In this case, = a 2 and p x = p 2 and this leads to the following 
implicit equations for k: 

{£l — _ m 2 (u k e L — v' c £r) 1 

(£l— mi(u k £ L — v k £ R ) 2 

and 

(Sl-Sr), {u k -v% 

{£l — £r ) 2 (u fc — V k ) 2 ' 

Either equation can be solved numerically to determine the value of k and hence of the other 
reflectivity parameters a and p. 








4.1. Numerical Stability 

While the equations derived in section 3 were observed to be numerically unstable, the 
above solution is much more sound. This can be observed by the following simulation. Consider 
a sphere illuminated by a point source, and observed by two sensors whose angle of vergence is 
/3 = tan - (roughly equivalent to a human observer fixating on an object 1 meter removed). 

The surface of die sphere can be given any surface reflectance properties by specifying the 

parameters k, p and a. For some set of parameters, we compute the observed brightness value in 

each sensor to one part in 256. We then apply the equations derived above (assuming initially 

that the value of k is known) to compute the parameters a and p. The accuracy of the computed 
values is compared to the original value under a number of situations. 

Since a certain amount of error will follow from the truncation of brightness accuracy, we 
measured the error in computing the parameters as a function of the difference in observed 
brightness values. It will be noted that we can trade off accuracy of computed parameters with the 
percentage of orientations for which such a computation can be made (see Table 3). In particular, 
as we increase the required lower bound on the observed differences in brightness, we decrease the 
total percentage of orientations for which a computation can be made, but increase the percentage 
of such points which give rise to computed surface parameters within a particular error range. 
Table 3 illustrates some examples. 


k 

vis 


p-1 

P-5 

P-10 


a-1 

a-5 

a-10 

i 

25 


21.9 

52.6 

62.0 


5.1 

17.8 

31.0 

2 

37 


69.7 

95.8 

98.4 


6.8 

28.5 

54.6 

3 

39 


66.3 

95.2 

99.4 


11.9 

39.8 

66.6 

4 

38 


60.1 

93.6 

99.0 


13.9 

44.2 

72.0 

5 

36 


57.7 

92.2 

98.6 


13.9 

46.1 

76.0 

6 

34 


56.4 

91.8 

98.4 


15.2 

49.3 

78.6 

7 

32 


56.4 

91.0 

98.2 


16.3 

52.4 

80.3 

8 

31 


55.3 

90.6 

98.1 


15.0 

52.7 

82.7 

9 

29 


55.4 

90.9 

98.3 


18.3 

55.5 

84.1 

10 

28 


55.2 

90,1 

98.0 


17.7 

56.8 

85.0 



k 

vis 


p-1 

p-5 

E9 

1 


a-5 



i 

0 


0.0 

0.0 

l 


■ 




2 

0 


0.0 

0.0 


1 

■ 




3 

2 


51.3 

98.0 

■ 




13.3 


4 

7 


73.2 

99.0 

■ 






5 

11 


72.5 

99.1 



■ 

52.9 



6 

13 


70.6 

98.5 


1 

■HIM 

57.6 

■a 


7 

14 


71.0 

98.8 


1 

mum 

mm 

90.1 


8 

14 


69.6 

98.6 

iifil 


16.7 

61.3 

92.7 


9 

14 


69.9 

98.4 

100.0 


22.6 

64.7 

92.7 


10 

14 


70.0 

98.5 

100.0 


21.2 

66.2 

94.7 


Table 3: Errors in computing a and p. 
The following parameters are defined: 
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Vis is the percentage of viewed orientations on a sphere for which the difference in observed brightnesses 
exceeds some threshold; 

a-e is the percentage of vis which lie within a range ±e% of the coned value of a; 
p-e is the percentage of vis which lie within a range ±e% of the correct value of p; 
e is the percentage error bar allowed about the correct value. 

In this case, the illuminant direction is given by p s = 0.557, q s = 0.239. In the first table, the 
threshold of image differences was set at 3, in the second table, the threshold was 5. 

To compute the value of k, we must solve an implicit equation in k. This can be done 
numerically, for example, by Newton-Raphson. While computer simulations indicate that this 
computation is very robust in the case of perfect image brightnesses, the computation is much 
less stable when considering truncated brightness values (i.e. if infinite precision brightness values 
are used, the computation returns very accurate values for k, but when the finite resolution of 
the brightness sensor is taken into account, the accuracy of the computation drops off quickly). 
Table 4 indicates an example of the error associated with computing k numerically. This table was 
computed by choosing two surface orientations at random, such that the difference in brightness 
exceed a threshold of 3 grey levels, and solving the implicit equation for k by a Newton-Raphson 
method. 

One possible method for improving the accuracy of computing not only k, but also p and 
a, is to consider tire reflective properties of the surface to be constant over some extended region 
of the image. In this case, a number of points may be sampled, and the computed values for the 
parameters averaged. Table 4 illustrates the reduction in error associated with computing k. 
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5% 


14.8 

15.6 

16.2 

18.6 

10% 


27.8 

30.6 

29.7 

35.3 

20% 


47.2 

50.7 

54.6 

65.3 

30% 


62.6 

65.9 

73.6 

84.5 

40% 


69.3 

77.4 

83.9 

94.7 

50% 


75.0 

85.4 

92.0 

98.5 


Table 4: Errors in computing k. 

Two different orientations were chosen at random, such that the difference in brightness values at 
each point exceeded a threshold of 3 grey levels. The percentage of such pairs of points with percentage 
error in computing k within the given range are indicated. In the case illustrated below, the illuminant 
direction was given by p s = 0.557, q s = 0.239, with p — 256, a — 0.5 and k — 5. The percentages are 
shown for the case of sampling 1, 3, 5, and 10 pairs, and averaging the results. 


5. Solving for the Surface Orientation 


Given that the equations derived in Section 4 can be used to determine the surface reflectivity 
parameters k, p and a, we turn to the final stage, which is to compute the surface orientation 
along the matched zero-crossing contours of the stereo algorithm. 





The algorithm described below is essentially a modification of the photometric stereo algorithm 
[Woodham, 1978, 1980, 1981; Ikeuchi and Horn, 1979; Silver, 1980], It can be sketched as follows. 

(1) Compute the reflectance map [Horn, 1977, 1981; Horn and Sjoberg, 1979] associated 
with the computed values of p, a and k. Since R is a function of the unit surface normal, 
it can be calculated over the unit disk, with coordinate axis given by tire first and second 
components of tire unit normal. 

(2) Use the measured value of C L at a point to determine a contour of isobrightness in the 
unit disk. This defines the set of possible surface orientations, which are consistent with the 
value of S L - 

(3) Use the measured value of S R at the corresponding point to determine a second contour 
of isobrightness in the unit disk. The intersections of tire two contours define possible surface 
orientations. Note that in general there will be at least two such points of intersection. 

(4) We can disambiguate the possible orientations by applying a third constraint. Take a 
directional derivative of depth along the contour through tire point. This will yield a linear 
constraint between p and q. When translated to the unit disk (and a constraint between the 
components of tire unit surface normal), this constraint becomes a half ellipse in the unit 
disk. The intersection of this curve with the two isobrightness contours defines the correct 
surface orientation for the point in question. 

The key question still to be considered is whether this algorithm results in a unique solution. 
We first observe that the isobrightness contours for either a matte or a specular surface consist of 
a series of nested ellipses or half-ellipses. In particular, for a given value R{n) = c, the ellipse has 
center in the unit disk of ( cp s , cq s ) where p s and q s are tire first and second components of the 
unit source vector s. The minor axis of the ellipse is oriented along the line from the origin to 
the projection of s and the major axis is perpendicular to it. 

While it is possible to derive this analytically, it more easily observed by noting that 
for a Lambertian coated sphere, with source located along the same direction as the viewer, 
the isobrightness contours are nested circles. For an off-axis light source, the corresponding 
isobrightness contours can be obtained by rotating the sphere relative to the viewer, and then 
projecting the previous isobrightness contours onto the unit disk, clearly resulting in a series 
of nested ellipses. A similar argument holds for the isobrightness contours of a purely specular 
surface. 

When considering a convex combination of two such reflectance maps, the isobrightness 
contours become somewhat more complex. In general, however, the contours are slightly distorted 
ellipses (see Figure 3). As a consequence, the intersection of isobrightness contours from the two 
eyes will in general result in a pair of points in the unit disk. (This excludes the degenerate case 
of a pure matte surface, in which case the intersection will be a complete ellipse.) We must now 
determine the circumstances under which the third constraint will disambiguate this pair of points. 

Suppose that the zero-crossing contour at tire point of interest has a tangent in the image 
plane whose angle with respect to the x axis is 7 , and whose derivative of depth along the contour 



Figure 3. Examples of isobrightness contours on the unit disk, for different reflectance maps. In all 
four cases, p s = 0.7, g s = 0.3. For cases (a), (b), and (c), the exponent k — io, and the parameter a is 
0.1,0.5 and 0.9 respectively. In case (d), the exponent is k = 5 and a = 0.5. 














Figure 4. Constraints on the unit surface normal, as a function of the component of the surface 
gradient along a contour. Each contour delineates the possible values of the unit surface normal, for a 
different value of the surface gradient along the zero-crossing contour. In each of the contours illustrated 
here, the parameter 7 was negative. The straight line corresponds to the case in which the depth derivative 
along the zero-crossing contour was zero. 

is given by die value d. This derivative determines the component of the surface normal along the 
contour. When translated into a constraint on the components of the unit surface normal, the set 
of possible unit normals which are consistent with such a component along the contour are given 
by the parametric equations (assuming 7 is non-zero) 


«iW = 


n 2 (t) = 


t sin 7 

(sin 2 7 + d 2 — 2 dt cos 7 -f- f 2 ) 2 
d — t cos 7 

(sin 2 7 d 2 — 2 dt cos 7 -M 2 )^ 


These parametric equations define a half ellipse in the unit disk with center at the origin and with 
major axis the line through the origin with slope —cot 7 of length 2. If 7 is positive, the half 
ellipse is that which connects the two endpoints of the major axis and whose half of the minor 
axis lies the the positive n 2 half disk, if 7 is negative, the half ellipse lies with its half of the 
minor axis in the negative n 2 half disk (see Figure 4). 


We wish to determine die conditions in which there is not a unique solution, which 
corresponds to determining the conditions under which the half ellipse constraint passes through 
both points arising from the intersection of the isobrightness contours. Note that the family of all 
possible depth constraints can be represented by the pairs ( 7 , b) where 7 ranges from —tt to ir 
and 6 is the length of the minor axis of the half ellipse. Consider a point ( p 0) q 0 ) lying in the unit 
disk. Let 

£ = sgn(g 0 )tan —1 —. 

Po 
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Then the valid range of possible 7 is from £ - f to £ -f §, modulo 2 tt. Thus, the set of possible 
depth constraints which pass through this point can be specified by the graph given in Figure 5, 
which has a range of * in 7 and a maximum in b of Vpo + 9 o- For a second, distinct point in 
the unit disk, a similar mapping is obtained, which intersects the first at exactly one point. This 
implies that there is exactly one ellipse, with orientation and eccentricity given by this intersection, 
which passes through both points in the unit disk. This in turn implies that die depth constraint 
will almost always uniquely specify the surface orientation of the zero-crossing point, since only 
a particular alignment of the light source and the depth contour will result in an ambiguity, and 
in general, the physical processes which give rise to tire depth contour constraint are independent 
of the light source. In the case of continuous value ranges for 7 and d, die set of conditions 
in which a unique answer is not possible will, in fact, have measure zero. In the more practical 
case of a discrete range of values for 7 and d, assume that the orientation 7 can take on one 
of n different values, while the component of the depth gradient along die contour d can take 
on one of m different values. Then assuming that the possible values for 7 and d are uniformly 
distributed, and assuming that the constraints due to die reflectance map isobrightness contours 
are independent of the constraint due to the depth derivative implies that the probability of an 
ambiguous computation of surface orientation is Clearly, if we have a reasonable resolution 
for 7 and d, diis probability will be very small. 

6. Discussion 

We note that it is possible to solve for the reflectivity parameters without requiring two 
corners. Suppose instead that we have one corner, and that the surface is at least twice continuously 
differentiable. Then by noting that integration of surface orientation around a closed loop must 
yield zero (note that a similar constraint has been used by Ikeuchi and Horn [1981]) we can 
determine die surface orientation at a second point, and the analysis of Section 4 still holds. 

We also note that the numerical properties of the algorithm sketched here probably preclude 
its use by the human visual system. In particular, to have reasonable numerical stability over a 
large range of surface orientations, a large vergence angle is required (on the order of 3 °). For the 
human system, this is roughly equivalent to viewing objects from a distance of 1 meter or less. 
In cases of machine vision, where control of the positioning of die sensors is possible (i.e. we are 
not restricted to interocular separations of 6 cm) large vergence angles are more feasible. 

7. Error Analysis 


As well as performing simulations of the computation of surface parameters, we can also 
make an estimate of the error associated with computing the parameters. This analysis will also 
indicate a means for determining diose situations in which the error is small. 

We again consider the situation in which the parameter k is known exactly, as well as the 
values of p and q and hence of m, u and v. We let 









Figure 5. Determining the uniqueness of the solution. Parts (a) and (b) represent constraints between 
the parameters b and 7 , for different values of £. When a second point is added (c), the intersection of 
die two contours defines the only possible half ellipse in the unit disk which passes through both points 
in the unit disk and has appropriate major and minor axes. Thus, given two isobrightness contours in the 
unit disk, from the two views of a point on a zero-crossing contour, there is exacdy one depth contraint 
which will not disambiguate these points. 
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— measured value for a 
Pm == measured value for p 
a 0 = actual value for a 
Po = actual value for p 
Sc = error in S L 
Sr — error in Sr. 

The goal is to determine estimates for the ratio of the measured values to the actual values, that 
is, to derive expressions for 


Qm 

a o 

Pm 
PO ' 


By expansion, we obtain the following expression 

. _( — Sr + Sl — Sr \ 1 

V u k — v k )s m 


(\ a- s L~ s n \ Po 
V s L -s R )e s - 


Thus, 


Similarly, 

“m 

so that 


e 


p 



Sr \ 1 

Sl — S R )e a 


™(£l -S r + s l — s R ) _ 

(u k — m)(£ R -f S R ) — (v k — m)(S L + S L ) 

a J i j_ 6 ‘- ~ 6r ](i _ (u fc - m)S R - ( v k - m)S L 

\ £l — £rJ\ {u k — + S R ) — (v k — m)(£ L -f Sr) 


ta _ f 1 , 6 l — S R _ (u k - m)S R - (v k - m)S L \ 

V £l — £r)\ {u k — m){£ R + S R ) — {v k — m)(£ L + S L )J‘ 

The goal is to determine what circumstances will result in small errors in the computed 
parameters (i.e. under what conditions will e p s» 1 and e a « 1). 


Algebraic substitution yields the following alternative forms: 
e —21 ^l — Sr 1 S r £ l — 6 l £ r 

P £l — Sr p m(£ L — £ B ) 

£ a = fi _|_ 6l ~ 6r Vi — s l — S r + ^(£ L S R — £ r S l ) \ 

V Sc — Sr/^ £ l £ r -j- Sc Sr -f- ^{£ l 6r — SrSc) J 

We can see that the only circumstances in which the error will be large are when \S L — S R \ is 
not significantly greater than | Sc — S R \. Since in general, |<5^ — <5 K | will be no more than 1, by 
restricting our attention to portions of the image in which \£c — £r\ is larger than some threshold, 
we can reduce the probable error in our estimation of the surface reflectivity parameters. 
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8. Summary 


We have demonstrated that shading information can be combined with stereo data along 
feature point contours in order to determine the surface orientation along those contours, under 
the following conditions. 

(1) The surface is illuminated by a distant point source, whose direction relative to the viewer 
is known. 


(2) The surface is viewed from two different positions, and the angle of vergence of the two 
views is known. For numerical stability, this angle should be on the order of 3 degrees or 
larger. 


(3) The surface is at least twice continuously differentiable. 

(4) The reflectance map of the surface can be represented in the form 


m 


(1 — a)(n • s) + a{n • h}* 


where s is a unit vector in the direction of the light source, h is a unit vector in the direction 
of die off specular angle. 


h =- £ ± - v ) ■ 

and n is a unit surface normal. The variables a, p and k are unknown, and are to be 
computed from the irradiance data. It is assumed that a is non-zero, so that the surface is 
not perfectly matte. 


Under these conditions, we have shown that using the brightness values recorded at 
corresponding points in the two images (determined by the stereo correspondences) we can 
compute the reflectivity parameters for the surface, and using in addition the component of the 
surface gradient along the stereo contour, we can apply an algorithm similar to photometric stereo 
to compute the surface orientation along the stereo contours. 

The numerical stability of the computation was also explored and methods for reducing the 
possible error in the computation were suggested. The key observation concerning the computation 
was that while accurate solutions were possible given high resolution input (e.g. the image 
brightnesses were known to extremely high accuracy), when the discrete nature of the brightness 
sensistivity and spatial sensitivity of the imaging devices was taken into account, the numerical 
stability of the computations degraded rapidly. This suggest that in its current form, computations 
such as those outlined here may be of limited practical use, until methods for improving the 
numerical stability are derived. 
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10. Appendix 


The solution for the reflectivity parameters, using derivatives of the image irradiances is 
summarized as follows. The solution to k is given by 

k = j _|_ In {(a 2 + m p a s )u q — (qi -f m g a 3 )u p } — In {(a 2 — m p a 3 )v q — (ai — m ? Q 3 )f p } 

In v — In tt 

the albedo p is given by 


2 - ) 0 ^ Sl + ~ \p c + ~ K 1 ~ V )C + (1 — Qg]g 

9 V©7 + um,)$ + [(1 — v)m v + (1 — £)m,]'I' ' 

and the specular scalar a is given by 


where 


a 


1 9 
p 9 


°i — pm p (A x T, y — AyEj) + A y E y — wA^Ej) + A y E x rn q (p£ — wv) 

°2 A X Ey AyE X ) “p ^Ip^AyEy ^AjEj) -p A a E yWlpint — lOV) 

03 = wA l + (uu — n() A z A y — fiAy 

where 

^ = (Ai + vA y )A — (A y -f £A X )B 
if — pAA y — uBA x 
0 = pA 2 + (w v — p£)AB — w B 2 

and where 

A x = ^-^ 
dx dx 

a — d£ R 

V dy dy 

dx dx 

v _ _L 

V dy + dy 

A -- k(u p u k ~ l — v p v k ~*) 

B = k{u q u k ~ l — v q v k ~ ! ) 

C = k(u p u k ~ 1 + v p v k ~ 1 ) — 2m p 
D — k(u v u k ~ 1 + v q v fc—1 ) — 2m, 

and where the second directional derivatives along the two directions of the zero-crossing contour, 
whose values are d x and d 2 with associated directions 71 and 72 , determine the constants 







d\ sin 2 72 — ^2 sin 2 71 
M sin ( 71 + 72 ) sin ( 72 — 71 ) 
d s cos 2 71 — d\ cos 2 72 

OJ — - - ---;-- 

sm( 7 i + 72 ) sin (72 — 71 ) 

_ 2 sin 71 sin 72 
sin (71 -f 72 ) 

2 cos 71 cos 72 
sin ( 7 x + 72 ) ' 

Finally, the surface orientation at this point is given by the three parameters 

A y lj.A—A x u)B 

fxy ~ 

fxx — H — vfxy 
fyy = w £fxy 

10.1. Discussion of the Equations 

Several comments on the form and derivation of the equations are appropriate. First, we note 
that in deriving the above expression for a, die form obtained is valid only in the case of both 
A* + A y and A x — A y non-zero. This provides an interesting side effect, since these expressions 
both vanish if and only if the surface material has only a matte component, and no (measurable) 
specular component, or the surface is a plane. If the surface material has no specular component, 
a — 0 and this is exactly the expression obtained by substitution of A x — 0 and A y — 0 into the 
expression obtained for a. 

Second, we note that, subject to possible numerical errors, the solution for the surface 
reflectance parameters is unique. 

Third, we note that it is important to consider the measurability of the image knowns. In 
other words, can we extract the required measurements for the images in a reliable manner? We 
first note that the zero-crossing contours, along which the stereo depth information is known, will 
frequently outline discontinuities in the surface or in the surface material, for example, occluding 
edges of objects or changes in surface albedo. Since we cannot measure infinitesimal derivatives 
in the image, does this preclude our ability to measure derivatives of the images? The answer 
is no, since we can again rely on directional derivatives. For example, we need to measure 
-§^{£l — £r) and -§^{Cl — £r)- This can be done without crossing a discontinuity in the image 
irradiances by considering the two directional derivatives along the contour obtained at a corner in 
the zero-crossing contour. At this point, we have two different directional derivatives, and simple 
algebra allows us to then determine the parametric partial derivatives. 
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